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1. INTRODUCTION 

The Space COntrol Labaratory Experiment (SCOLE) is a challenge for control 
engineering applications. This is a result of the system dynamics, the available mea- 
surement information, the actuator capabilities and finally the specified performance 
requirements set. 

Results on the use of Model Reference Adaptive Control have already been re- 
ported [1], In view of the necessity for rapid response, this work deals with an optimal 
control formulation, with a minimum time requirement and constrained input. 

In Section 2 a mathematical statement of the problem is presented. The time 
optimal control formulation is presented in Section 3, and the reasons that make such 
an approach not promising are discussed. As a result, a pseudo time-optimal control 
algorithm is presented in Section 4. In Section 5, the proposed approach is tested 
to see if it satisfies the design specifications, and finally in Section 6, discussion and 
suggestions for futher research are provided. 
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2. BRIEF STATEMENT OF THE PROBLEM 

The antenna-beam system to be controlled is modelled as: 

x = A-x + B -u A € ft 12xl2 ,£ € » 12x3 

Only the 12th order approximation of the distributed parameter system is considered 
here. The states are organized in the form of modes, that is x 2 ,_i position, x 2t velocity, 
i = 1,2, ..., 6. The modes are dynamically decoupled, as can be seen from the structure 
of matrix A: 

' J x 0 0 ... 

0 J 2 0 ... 

A = 

: : ••• 0 

.0 0 0 Je 

However, the coupling is introduced by matrix B where all three controls afFect 
all the modes: 

B 1 ' 

b 2 o o 0^ 

B = B, = 

: [ B ti B,2 B t 3 , 

> Bq . 

The output to be controlled is 

y = 2/1 =C p -x C p € 3? 2x12 

k V2 - 

where y u y 2 are the X and Y coordinates of the tip of the antenna, and 

/C pll 0 C pl3 0 ... C pl6 0\ 

P “lc p21 0 C p23 0 ... C p26 Q) 

The actual measured output is 

y m = C m -x C m € ft 3x12 . 
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where 

( 0 C m 11 0 C m i3 0 ... C m i6 > 

C m — 0 Cm21 0 Cfn23 0 • • • Cm26 

\ 0 C m 3i 0 C m 33 0 ... C m 36 / 

Obviously, the output matrix C p incorporates at the desired output only the position 
component states, while C m maps at the output only the velocity components of the 
modes. 

Given an initial output, such that the displacement ||y(0)|| = d 0 = 1 ft , it is 
desired to transfer the state vector to x(tf) = 0 in minimum time tj, subject to the 
constraints 

\ui\ < U i = 1,2,3 

where U = 6.25. The numerical values of matrices A, B,Cp,C m can be found in [l]. 
Starting with an initial state at time t = 0: 

Xq(0) = [—3.3013 0 -0.71870 ... 0] T 

corresponding to the initial output 

f 3.7947 'l 

2 /( 0 ) = c p • * 1 ( 0 ) = l|y( 0 )|| = 4> = 1 ft 

11.3842 J 

the output of the free response of the system is shown in fig. 1. It is obvious that the 
settling time is tj > 1000 sec. 

In order to improve the response of the system, the following design specifications 
were considered: 

• The decision should be a feedback controller. 

• If possible, explicit use of observers should be avoided. 

• The decision scheme should be implementable in an ON-LINE fashion. 
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• The design should provide a controller robust with respect to initial state and 
to parameter uncertainties. 

In the sequel, a description of the ways that the problem was attacked is presented, 
and simulation results are given and analyzed for performance evaluation. Finally 
suggestions for improvements are given. 

3. THE MINIMUM TIME PROBLEM 

The first approach towards trying to solve the above problem was to formulate it 
as the simple minimum time control problem: 

min tt 

U J 

S.t 

x = A ■ x + B ■ u 

x(0) = ar 0 x (tf) = 0 
\ui\<U i = 1,2,3 

Numerical implementation of this optimal control problem requires its transfor- 
mation to a Two Point Boundary Value Problem (TPBVP). Such a transformation 
is obtained by introducing the scalar variable 

t 

*/ 

thus transforming the problem to the equivalent 

min / z ■ dr 
« Jo 

s.t 

x' = (A • x + B • u) ■ z, (.)' = 
z' = 0 
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x(0) = x 0 


x(l) = 0 


|u,| < U i = 1,2,3 

Solution of the above problem using multiple shooting methods, was not possible due 
to numerical difficulties. The major problem is the almost marginal stablitiy of the 
system that makes necessary the use of very small sampling intervals, and thus a very 
large number of intervals and parameters. 

Another approach to solving the above problem was that of [2], which in fact 
transforms the infinite dimensional problem to a finite dimensional one. A series of 
Linear Programs have to be solved. Similarly, the large number of variables along with 
the resulting constraints that are extremely restricting, made the solution impossible. 
4. THE PSEUDO-MIMIMUM TIME PROBLEM 

All the above problems that led to numerical instabilities convinced us that find- 
ing a minimum time control with the optimal control formulation presented above, is 
a very difficult task. Its application would be even harder due to the fact that it is 
open-loop control and therefore non-robust. Additionally, minimum time control tra- 
jectories obtained for reduced order subsystems showed that the controller saturation 
levels were changing signs every 3ms making the implementation impossible. There- 
fore another approach was adopted; namely, the LQ Regulator with fixed function of 
final state [3]. Such an approach is defined as follows: 

SYSTEM 

xjt+i = A • x k + B ■ u k 


PERFORMANCE INDEX 

1 1 N-l 

J = ~ X N ■ S ■ x N - f - (xl • Q ■ Xfc + Ufc • R ■ u k ) 

L Jt=o 


S N > 0, Q > 0, R > 0 
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FINAL STATE CONSTRAINT 

C p • xn = 0 

OPTIMAL CONTROL LAW 

K k = ( B T • Sk+i ■ B + R ) -1 • B t • Sk+i ■ A Sn ■ given 

S k = A T -S k+l -(A-B-K k ) + Q 

V k = (A-B-K k ) T V k+1 V N = Cj 
P k = P k+1 - V? +1 ■ B ■ (B t ■ S k+l ■ B + R)- 1 • B t ■ l4+i Pn = 0 
K u k = ( B t • S k +i ■ B + R)- 1 ■ B t 
G k = ~(I<k - Kl • V4 +1 • P^ 1 ■ V k ) 
u k = G k - x k 

Obviously this is a state feedback controller. The measurements matrix C m con- 
tains linear combinations of the velocities of the modes. Therefore state x can be 
derived only by using an observer. An effort to avoid observer utilization is reported 
in the next section. 

The approach, described above, allows much flexibility in the selection of the 
parameters of the matrices 5, Q, R. Since the output constraint considers only the 
linear combination of the position components of the states, the state can be reduced 
by penalizing the final state, i.e by having a large S matrix. In the present set of 
experiments, 5 was 50000 * I. The selection of the R matrix directly affects the 
maximum value of the control trajectories. However since the bounds in all controls 
are the same, R was chosen diagonal with the same elements for all controls. In the 
present set of simulations R was I. Finally, the selection of Q matrix, provides some 
valid flexibility since it is obvious from the control structure that the control values 
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are directly proportional to the states. An iterative adjustment was made using 

q " E?=i max fe[ o, t/ ](x?) 

Finally, the number of steps N that define the final time tj, have to be found exper- 
imentally based on the selected strategy. 

As an example, consider again the initial state 

4 =[-3.3013 0 - 0.71870 ... 0] r 

corresponding to ||y(0)|| = 1 ft. Using the proposed controller, the system’s re- 
sponce can be shown by the trajectories of the two components of the output Vk = 
C p ■ x k that are shown in fig.2. The corresponding optimal control trajectories 
u* k i = 1,2 are shown in fig. 3 (because u 3 is of the order of 10“ 1 , it was not 
shown). Inspection shows that the output is lead to zero while the input satisfies 
the constraints. The settling final time was found to be tf = 130 sec which is 
considerably less than the settling time of t } > 1000 sec of the free system of fig.l. 
5. DESIGN SPECIFICATIONS TESTING 

The proposed approach might be considered an ON-LINE procedure, because the 
set of feedback gains G k can be computed fast. This is so, because the computation of 
Gjt’s is not iterative, but it is straightforward. It is also a feedback policy because the 
control input is given by u k = G k - x k . However this is state feedback, and not output 
feedback as required. Additionally, a problem with this approach is that although the 
output is going to be lead to zero for every initial state, the input u k is not guaranteed 
to be bounded by U. If the only considered measurements are y mk = C m -x k , (i.e linear 
combinations of velocities), then either the explicit or the implicit use of observers is 
unavoidable. 

Assuming that the desired ouput y k = C p -x k is available, an attempt to obtain an 
approximate estimate of the state, was the following: Matrix C p is not square, and 


7 



therefore an invertible submatrix C p E 3£ 2x2 of C p was sought, so that 

= K)~ l • y k 

and 

*4 = W(1) 0 ii(2) 0 ... Of 

could be used as an approximation to x*. By considering the first two modes of the 
system as dominant, matrix C' p was constructed as 

Cp = [Cp.x C pt3 ] . 

Still, this approach did not seem to work as can be seen by the following example: 
Assume that at time t = 0 the system is at state 

x 2 = [° 0 0 0 0.3018 0 56.0972 0 ... 0] r . 

corresponding to an ouput, y 0 = C p • Xq = [0.7589 2.2768] r with j | t/o 1 1 = 2.4 in. 

Obviously, the magnitude of this initial deviation is smaller than the 12 in require- 
ment, but this was done because of memory constraints of the computing system 
used. According to the above scheme, the estimated initial state corresponding to 
t/o was 

xl = ( C’ v )- X ■ t/o = [-0.6603 0 - 0.14370 . . . 0] T . 

This was the state that the controller actually “observed” at t = 0. Using xj as an 
initial estimate of the state, a set of gain matrices G*. k = l,2,...iV was obtained by 
the algorithm of section 4. N was chosen in a such a way that the input 

Uk = Gk ■ %k 


of the fictitious system: 


Xfc+i = A ■ x k + B • u k 
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are bounded by U. Since boundedness depends on the initial state, we denote the set 
of gains obtained for this fictitious system by Gjt(xJ) k = 1,2, ...N, in order to show 
that these gains were selected based on xj. 

The response of the output of the fictitious system is shown in fig. 4, and the 
inputs Ui i = 1,2,3 of the fictitious system which are always less than U = 6.25 are 
shown in figures 5,6 and 7 respectively. The final time for the fictitious system was 
found to be f/ = 13 sec. 

However, the initial state was not xj but actually Xq which was essentially un- 
known to the controller. The control gains Gjt(xo) were calculated based on the esti- 
mate Xq of the initial state, and unfortunately, the control inputs u ^ = (<?;)-'•»] 

that were actually fed to the system, exceeded by far the bound U = 6.25, while the 
actual output yk was not lead to zero. Even an increase of the final time tj from 
13 sec to tj = 100 sec at the fictitious system (which meant the calculation of a 
whole new set of gains G*(xJ) k = 1,2 , ....iV 7 ) did not give better results, as can be 
seen in figures 8 and 9 for y(l),t/(2),u(l) and u( 2) respectively. 

Finally, the robustness properties to parameters variation were tested by increas- 
ing the resonant frequencies of the system by 10%. Using as initial state the previously 
used Xq, the set of gains obtained for the nominal system for tj = 13 sec and finally 
assuming state feedback, the “perturbed” system’s output was lead to zero as shown 
in fig. 10, however the control (fig. 11) exceeded U. The responces for this perturbed 
system do not seem to be much different than these of the nominal (fig. 4,5,6). An 
increase of the final time to tj = 15 sec seems to resolve the problem since the 
output is lead to zero (fig. 12) and the control is now bounded (fig. 13). These can 
be compared to the corresponding reponses of the nominal system for tj — 15 sec 
cited at (fig. 14 and 15). 

6. DISCUSSION 

It is obvious, that the proposed scheme requires knowledge of the full system state 
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for efficient implementation. In this case the system performs acceptably since the 
output is lead to zero, while the state, because of the penalty imposed by matrix S 
is lead close enough to zero. Additionally, the properties of the control scheme with 
respect to system parameters variation are satisfactory if a larger final time is chosen 
for the nominal system. The on-line calculation of G(t) will not be a big issue if 
current parallel computation schemes are utilized [4]. 

In the case that the only available measurement is y m = C m ■ x, which is a 
linear combination of the velocities of the system, then no obvious way to guarantee 
performance exist other than use of an observer. 
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Fig.14 Output response yl and y2 of nominal system with tf=15 sec 







Fig. 15 Inputs ul and u2 of nominal system with tf=15 sec 



